Streamline similarity analysis using bag-of-features
نویسندگان
چکیده
Streamline similarity comparison has become an active research topic recently. We present a novel streamline similarity comparison method inspired by the bag-of-features idea from computer vision. Our approach computes a feature vector, spatially sensitive bag-of-features, for each streamline as its signature. This feature vector not only encodes the statistical distribution of combined features (e.g., curvature and torsion), it also contains the information on the spatial relationship among different features. This allows us to measure the similarity between two streamlines in an efficient and accurate way: the similarity between two streamlines is defined as the weighted Manhattan distance between their feature vectors. Compared with previous distribution based streamline similarity metrics, our method is easier to understand and implement, yet producing even better results. We demonstrate the utility of our approach by considering two common tasks in flow field exploration: streamline similarity query and streamline clustering.
منابع مشابه
SubSift: a novel application of the vector space model to support the academic research process
SubSift matches submitted conference or journal papers to potential peer reviewers based on the similarity between the paper’s abstract and the reviewer’s publications as found in online bibliographic databases such as Google Scholar. Using concepts from information retrieval including a bag-of-words representation and cosine similarity, the SubSift tools were originally created to streamline t...
متن کاملProper Noun Semantic Clustering Using Bag-of-Vectors
In this paper, we propose a model for semantic clustering of entities extracted from a text, and we apply it to a Proper Noun classification task. This model is based on a new method to compute the similarity between the entities. Indeed, the classical way of calculating similarity is to build a feature vector or Bag-of-Features for each entity and then use classical similarity functions like C...
متن کاملTaxonomy-based Document Clustering
AbstrAct: One well-known document representation for text clustering is bag-of-words. Although it is simple and popular, it ignores semantics, underly ing linguistic information, and word correlations. In this paper, Bag-Of-Queries, a new document representation is proposed. First, a taxonomy of the terms in the local dictionary derived for data set is extracted. Ex tracting taxonomy is perform...
متن کاملVisual Analysis of Transport Similarity in 2D CFD Ensembles
Currently, there are no methods of visual analysis for ensemble vector fields (EVF) that provide identification of flow trends and general flow similarity over the extent of transport across ensemble members. Finite-time Variance Analysis (FTVA) provides flow structure information only on particle distributions at the termination of streamline integration. In this paper, we first present a flow...
متن کاملA Content-based Music Similarity Retrieval Scheme by Using BoW Representation and LSH-based Retrieval
This extended abstract paper presents detailed information about a content-based music similarity retrieval scheme, which is based on locality sensitive hashing (LSH). Our scheme considered MFCC and time histogram (TH) as two major features to represent the properties of audio music similarity. Next, each feature is depicted by Bag of Words (BoW), which k-means clustering summarizes extracted f...
متن کامل